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Abstract 

Kolmogorov's statistic is used for the analysis of properties of pertur- 
bations in the Cosmic Microwave Background signal. We obtain the maps 
of the Kolmogorov stochasticity parameter for W and V band tempera- 
ture data of WMAP which are differently affected by the Galactic disk 
radiation and then we model datasets with various statistic of perturba- 
tions. The analysis shows that the Kolmogorov's parameter can be an 
efficient tool for the separation of Cosmic Microwave Background from 
the contaminating radiations due to their different statistical properties. 

1 Introduction 

The maps of Cosmic Microwave Background (CMB) radiation are among important 
sources of cosmological information. Various descriptors have been applied to study 
the non-Gaussianities in the CMB temperature maps. Kolmogorov stochasticity pa- 
rameter (K-parameter) [1|2|3| when applied to CMB temperature datasets [4|5| reveals 
the Galactic disk, hence, outlining the differences in the statistical properties of the 
disk radiation and that of CMB. The resulting Kolmogorov's map shows enhanced 
randomness, i.e. higher value of K-parameter in certain regions including the Cold 
Spot [6j, the non-Gaussian anomaly in the southern sky. Notably, the K-parameter 
of the Cold Spot satisfies the criterion expected for the voids, namely, increase of the 
randomness towards the walls of the void which act as hyperbolic (deviating) lenses 
for the CMB [F)[8]. 

The present paper is devoted to the analysis of the interplay between the prop- 
erties of datasets, which are crucial at separation of signals of different stochasticity 
(randomness) characteristics. First, we obtain the K-map for W and V bands of 
Wilkinson Microwave Anisotropy Probe (WMAP) data, which indicate the differences 
in the statistic at different contribution of the Galactic radiation. We then estimate the 
K-parameter for datasets of Gaussian, along with a perturbed Gaussian distributions. 
We start with the definition of the Kolmogorov statistic ffl. 

2 Method 

For a finite random sequence of numbers the Kolmogorov's theorem [l] aims to de- 
termine its stochastic (randomness) properties. Consider a sequence of real numbers 
xi, X2, ■ ■ ■ , x n and that x\ < X2 < . . • < x n . Having a continuous distribution function 



F(X) = P{x < X} 



(1) 



where P is the probability of event in brackets. The empirical distribution function is 
defined as 

fO , X < xi ; 
F n (X) = < k/n , Xk<X<Xh+i, k = 1,2, ...,n— 1 ; 

{ 1 , < X . 

Kolmogorov's stochasticity parameter is defined as 

A n = ^sup\F(X) - F n (X)\ (2) 

which according to the theorem is a random number defined by a distribution function 
<I> n (A) = P{\ n < A} that tends to a function <E>(A) uniformly when n — > oo, for any 
continous distribution F(x) 

$„(A) -> $(A). (3) 

where 

*( A )= E (-l) fc e~ 2,i2A2 ,<£>(0)=0. (4) 

k= — oo 

From Figure^ we see that the most probable values of A are between 0.3 and 2.2. 
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Figure 1: Kolmogorov's distribution 3>(A) (a) and it's derivative (f>(X) = ^'(A) 

(b). 

This means that for big enough n and random sequence x n the Kolmogorov's 
stochasticity parameter A n will have a distribution close to $(A) and if the sequence 
is not random, the distribution will be different. This gives us an opportunity to test 
the randomness of a sequence. 

Note, that for each studied sequence we can only get one value of A n , so that it is 
not possible to guess whether that number belongs to a certain distribution. Here we 
have an option to split a sequence into subsets, if the length of the sequence, n, is big 
enough. 

Consider a split of the sequence x n into m subsequences. We can get a new 
sequence A™ of length m, which should be a random sequence with distribution close 
to "^(A). So A™ is our option to proceed on the sequences. 
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3 Numerical Analysis 

The K-maps for the W and V bands of the WMAP 5-year data are given in Fig. 2. 




Figure 2: Kolmogorov maps computed for Wilkinson Microwave Anisotropy 
Probe's W and V band CMB temperature data. 

The visible differences are due to the different role of the Galactic disk contribution, 
i.e. of a perturbation. We will now model such an effect for the K-parameter. Our 
first aim is to find parameters and ranges for their values that will show whether we 
are dealing with stochastic sequence or no. The stochastic sequence can be resulted 
e.g. from a hyperbolic dynamical system [9] but can also have another origin. The 
length of the considered sequences are taken as 4096 which corresponds to the CMB 
maps in given HEALPix format of pixelization. 

We generate sequences of given length with certain properties, namely, we take 400 
different sequences that are divided into 4 following groups each having 100 sequence: 

1st hundred are random sequences with Gaussian distribution; 

2nd hundred are random sequences with perturbed Gaussian distribution; 

3rd hundred are sequences with Gaussian distribution, but they are not random 
but are extracted from a periodic sequence like x n = a n mod 6, where a and b are 
constants; 

4th hundred are sequences including two copies of the same random Gaussian 
sequence. 

Our choice of Gaussian distribution for F(x) is due to its interest for CMB maps, 
but in principle, it could be anything including a uniform distribution. 

So we have groups 1 and 2 that are stochastic and 3 and 4 which are non stochastic, 
and we can proceed with calculations for all sequences. In all figures the horizontal 
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axis i denotes the index of a sequence that runs from 1 to 400, i.e. if 1 < i < 100 we 
have values of the 1st group, if 101 < i < 200 for the 2nd group, and so on. 

Figure [3] shows results of computations. In the|3]a the values of A are shown for 
each sequence, we see correspondence to distribution <&(A) for 1st and 2nd groups, but 
not for 3rd or 4th ones. Figure [3]b contains similar plot for values <3?(A), in this case 
1st and 2nd groups should have almost uniformly distributed values between and 1, 
but not 3rd or 4th, and this is what we see. Figure|3]c shows that values <fi(\)/4> ma x(^) 
are close to 1 in 1st and 2nd case, indicating right distribution again. 

Finally, [|]d, is the % 2 calculated for each sequence's distribution as compared with 
theoretical Gaussian distribution. 




Figure 3: Computed values of A (a), $(A) (b), </>(A)/</> maa; (A) (c) and \ 2 (d). 

One can state that none of these analysis gives precise result/range of values, 
when it comes to a single sequence. The first 3 sequences do not allow to conclude 
whether a single sequence at index i is random or not, and the 4th figure of \ 2 cannot 
really distinguish the 3rd and 4th from the 1st and 2nd, as this is only information on 
distribution, not connected with randomness. 

When we split the sequences and obtain m different values A™ for each sequence, 
we can compute their mean value, A™ (mean). Also we can define empiric distribution 
function <3?„(A) for A™ in similar way to[2j and calculate the distance of that function 
from the theoretical one <&(A) - x 2 ($n(-M, 

The following are the results of calculations with split of each sequence. There are 
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two parameters, | A™ (mean) — A, nean |, where 

\mean = j \<j>(\)d\ « 0.875029, 

and 

X 2 (<MA),$(A)). 

Both parameters show that a sequence is closer to a random one, the closer is their 
value to 0. 

0.35 -i 




Figure 4: The behavior of |A™(mean) - 0.875029| and x 2 ($„(A), $(A)). 

Figure [4] shows that the sequence is stochastic if 

|A™(raean) - 0.875029] < 0.18 

or if 

X 2 ($„(A),$(A)) <0.03, 

and is non-stochastic otherwise. 

These two criteria do not distinguish the 1st and 2nd groups from each other 
since we are dealing with small subsequences that do not "feel" the perturbation of 
distribution, however if n is much bigger, then we could see their difference too. Note 
also that these ranges can change depending on n, with larger values of n the test will 
be more efficient, also there is a chance that they depend on theoretical distribution 
F(x) of sequence x n . This can be solved with consideration of F(x n ) for the sequence 
with a uniform distribution. 

4 Conclusions 

We studied datasets of various statistical properties to reveal the role of perturbations 
in the Cosmic Microwave Background, first of all due to the Galactic disk. We model 
different sequences of perturbed Gaussian distribution to obtain the behavior of the 
Kolmogorov's stochasticity parameter. This revealed the differences in the K-maps 
for CMB W and V bands of WMAP 5-year data, which are differently influenced by 
the contribution of the Galactic disk. Particularly, the Galactic radiation possess high 
value of A and $ around 1, which shows its non-Gaussian nature, while CMB signal is 
basically Gaussian with inhomogeneous perturbations of various degree of randomness. 
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The performed analysis shows that the Kolmogorov's parameter can serve as an 
indicator for separation of the CMB signal from the noise due to their different statis- 
tics. 

We thank Profs. V.G.Gurzadyan and A.A.Kocharyan for discussions and help. 

References 



[i] 

[2] 
[3] 
[4] 
[5 
[6 
[7 
[8 
[9 



A.N. Kolmogorov, G.Ist.Ital.Attuar, 4 (1933) 83. 
V. Arnold, Nonlinearity, 21 (2008) T109. 
V.I. Arnold, ICTP/2008/001, Trieste. 

V.G. Gurzadyan, A. A. Kocharyan, A & A 492, L33 (2008) 

V.G. Gurzadyan, A.E.Allahverdyan et al, 2009, A&A (in press); |arXiv:0811. 2832 



M. Cruz, E. Martinez-Gonzalez, P. Vielva, arXiv:0901.1986 



V.G. Gurzadyan, A. A. Kocharyan, A&A, 493, L61 (2009) 
S. Das, D.N. Spergel, [arXiv:0809.4704| 

V.I. Arnold, Mathematical Methods of Classical Mechanics, Springer- Verlag, 
1989. 



6 



